Back

Nature Human Behaviour

Springer Science and Business Media LLC

Preprints posted in the last 7 days, ranked by how well they match Nature Human Behaviour's content profile, based on 85 papers previously published here. The average preprint has a 0.16% match score for this journal, so anything above that is already an above-average fit.

1
Multi-ancestry genome-wide association study and meta-analysis of stimulant use disorder reveals biology and relationships to other psychiatric disorders

Beck, S. E.; Deak, J. D.; Levey, D. F.; Ge, T.; Jeffries, P. W.; Lai, D.; Mallard, T. T.; Degenhardt, L.; Lind, P. A.; Tollerup Nielsen, T.; Tubbs, J. D.; Wetherill, L.; Johnson, E. C.; Hatoum, A. S.; The SUD Working Group of the Psychiatric Genomics Consortium, ; COGA Collaborators, ; Yale-Penn Collaboration, ; The VA Million Veteran Program, ; Borglum, A.; Demontis, D.; Medland, S. E.; Martin, N. G.; Nelson, E. C.; Smoller, J. W.; Kranzler, H. R.; Gaziano, J. M.; Stein, M. B.; Agrawal, A.; Edenberg, H. J.; Gelernter, J.

2026-06-10 genetic and genomic medicine 10.64898/2026.06.05.26354997 medRxiv
Top 0.1%
13.9%
Show abstract

Stimulant use disorder (StimUD) is a significant public health problem, but genetic studies have been limited by small sample sizes. We conducted genome-wide association studies (GWAS) of StimUD in the Million Veteran Program (MVP) and All of Us (AOU), followed by meta-analysis with FinnGen and 10 additional datasets, for a total of 709,369 individuals (Ncases=33,977, Ncontrols=675,392) in four broad ancestry groups: European (EUR) (Ncases=22,564, Ncontrols=624,672), African (AFR) (Ncases=7,574, Ncontrols=34,189), Admixed American (AMR) (Ncases=3,657, Ncontrols=15,698), and East Asian (EAS) (Ncases=182, Ncontrols=833). Population-specific SNP heritability was 6.1% in EUR and 2.4% in AFR. We discovered a total of 19 genome-wide-significant loci, six in EUR, including DRD2*rs5794864, P=7.32E-10, one in AFR, five in a multi-ancestry meta-analysis, including CHRNA5*rs55781567, P=3.27E-9, two in a male-only meta-analysis, including FTO*rs8057044, P=9.50E10-9, and five in a meta-analysis of sex-stratified results. In a hold-out AOU subsample (NEUR=18,841, NAFR=12,263, NAMR=9,739), ancestry-specific polygenic risk scores were significantly associated with StimUD in EUR (OR=3.28, 95% confidence interval (CI)=2.89-3.71) and AMR (OR=2.01, 95% CI=1.71-2.37). Transcriptome-wide association studies, fine-mapping, and colocalization analyses prioritized additional genes (e.g., GPX1, BSN). Genetic correlation, Mendelian randomization, and causal mixture analyses revealed relationships with other substance use and use disorder phenotypes, including cannabis use disorder (rg=0.94, P=5.43E-237) and opioid use disorder (rg=1.01, P=4.40E-107), and other psychiatric traits, including anxiety, depression, neuroticism, and attention-deficit/hyperactivity disorder. This is the first well-powered GWAS of StimUD, and it offers significant insights into disease biology.

2
Cultural engagement and mental disorders: A prospective negative control analysis of the English Longitudinal Study of Ageing with linked Hospital Episode Statistics

Qin, P.; Steptoe, A.; Fancourt, D.

2026-06-08 epidemiology 10.64898/2026.06.05.26354991 medRxiv
Top 0.3%
7.0%
Show abstract

Cultural engagement is associated longitudinally with better mental health and reduced depression incidence, but evidence has largely relied on self-reported symptoms and diagnoses, leaving uncertainty about clinically recorded disorders, and residual confounding remains a concern. Here, we examined whether cultural engagement (including going to cinemas, museums, galleries, exhibitions, theatre, concerts, or opera) predicts hospital-treated mental disorders in 8,274 adults aged 50 years or older from the English Longitudinal Study of Ageing. Participant records were linked to ICD-10 diagnoses in Hospital Episode Statistics and mortality records with follow-up of up to 20 years. In fully adjusted Cox models accounting for sociodemographic, lifestyle, and social factors and multiple testing, frequent cultural engagement was associated with lower risk of any mental disorders (HR 0.71, 95% CI 0.62-0.82, FDR adjusted P value<0.001), dementia (0.71, 0.56-0.89, FDR adjusted P value=0.010), substance misuse (0.75, 0.59-0.95,FDR adjusted P value=0.040), and mood disorders (0.73, 0.56-0.95, FDR adjusted P value=0.044), but not neurotic disorders. Associations persisted after excluding early incident cases and adjusting for baseline depressive symptoms and cognition, and showed robustness to unmeasured confounders. To further probe causality, eye disease, ear disease, and traumatic brain injury, which share similar socio-demographic profiles to mental disorders, were prespecified as negative control outcomes. Cultural engagement was not associated with any negative control outcomes. These findings provide triangulated statistical data to suggest that cultural engagement is associated with reduced risk of several clinically recorded mental disorders and support further testing of cultural engagement as a population mental health strategy.

3
Surfacing Suicidal Risk Through Simulated Social Interaction: Per-Person Language Model Agents as Communicative Stress Tests

shao, w.; Ammerman, B.; Jacobucci, R.

2026-06-06 psychiatry and clinical psychology 10.64898/2026.06.04.26354928 medRxiv
Top 0.7%
4.2%
Show abstract

Suicidal risk may be encoded in everyday communication patterns but diluted in routine digital interactions. We introduce a method for surfacing this latent signal: training per-person language model agents on individuals' authored text (the on-screen text each participant typed, captured whenever a keyboard was visible in screenshots) and placing those agents in simulated social interactionsa communicative stress test. Using data from 79 adults with recent suicidal ideation, we ne-tuned individual LoRA adapters on Qwen3-8B using each participant's authored text, then placed agents in standardized conversations with probe personas. Agent-generated risk language was associated with EMA-measured suicidal ideation (r= .576, p < .001), with a single neutral small-talk probe performing nearly as well (r= 551). A shue control conrmed the signal is person-specic (r= .071 when adapters were mismatched), and automated descriptions of participants' general smartphone activity produced no signal, conrming specicity to interpersonal communication. A prompt ablation demonstrated partial robustness to removal of disclosure-encouraging language (r = .430). This proof-of-concept demonstrates that simulated social interaction can amplify latent vulnerability signals, bridging digital phenotyping, generative AI, andsuicide theory.

4
Snip Happens: A Retrospective Study of Vasectomy and Birth rates in Australia

Janetzki, J.; Modi, N.; Varney, B.; Pratt, N.; Ward, M.; Wiese, M.; Lim, R.; Kalisch Ellett, L.

2026-06-05 sexual and reproductive health 10.64898/2026.06.03.26354864 medRxiv
Top 0.7%
4.0%
Show abstract

Background Fertility rates in Australia have been declining over recent decades, reaching a record low total fertility rate of 1.48 births per woman in 2024. Concurrently, vasectomy remains widely accessible and increasingly normalised as a permanent contraceptive option. Despite extensive commentary on falling birth rates, no contemporary Australian study has examined vasectomy rates relative to birth rates over time. We aimed to compare population level vasectomy and birth rates across Australian jurisdictions and age groups. Study design Nationwide retrospective time-series study. Retrospective population-based study using Medicare Benefits Schedule item 37623 to identify vasectomy procedures performed between July 2015 and December 2024. Rates were calculated per 100,000 male population using quarterly Australian Bureau of Statistics (ABS) population estimates and summarised as rolling 12-month averages. Birth rates were derived using matched ABS data for women across equivalent age strata (18-24, 25-34, 35-44 years). Results: Vasectomy rates increased nationally from 32 per 100,000 in 2016 to 55 per 100,000 in 2023 before declining modestly in 2024. Birth rates declined from 5,200 to 3,800 per 100,000 over the same period. Trends were consistent across states and age groups, with the greatest vasectomy uptake in men aged 35-44 years. Conclusion: Australia is undergoing a demographic shift characterised by rising vasectomy uptake and declining fertility. While vasectomy rates remain lower than birth rates, their convergence signals changing reproductive intentions and contraceptive behaviours. Ongoing monitoring of permanent and long-acting contraception is essential to understand evolving population dynamics and inform reproductive health policy.

5
Daily symptom monitoring is sustainable over months: retention, not compliance, is the primary barrier to long-duration digital tracking

Gunsilius, C. Z.; Pei, P.; Carayannopoulos, A.; Petzschner, F. H.

2026-06-10 rehabilitation medicine and physical therapy 10.64898/2026.06.08.26355180 medRxiv
Top 0.9%
3.6%
Show abstract

Ecological momentary assessment (EMA) enables real-time, longitudinal measurement of symptoms and behavior via smartphones, yet nearly all feasibility evidence comes from protocols lasting one to two weeks, far shorter than the timescales over which chronic diseases fluctuate and clinical decisions unfold. Whether daily compliance can be sustained over months, or whether it decays as short-protocol trends predict, is unknown. Here, 214 participants (173 with pain, 41 healthy controls) completed a 4-month (122-day) EMA protocol via the Soma smartphone app, generating 26,907 check-ins. Half the sample completed the full protocol without a two-week lapse. Aggregate compliance appeared moderate (50%), but this conflated two distinct phenomena: when recomputed over each participant's active period, compliance rose to 71%, with 91% achieving moderate-to-high adherence, and remained stable across all 17 study weeks. Pain status predicted earlier disengagement but not lower compliance among those who remained; after adjustment for differential retention, group differences disappeared. To our knowledge, this is the longest continuous daily EMA evaluation in a clinical population. It suggests the primary barrier to long-duration EMA is not declining motivation among active participants but concentrated early disengagement, with direct implications for the design of digital health protocols, decentralized trials, and remote symptom monitoring.

6
Large Language Models in Healthcare Simulation Education: A Bibliometric Analysis with AI-Assisted Screening

Pears, M.; Wadhwa, K.; Payne, S. R.; Konstantinidis, S. T. H.; Biyani, C. S.

2026-06-04 urology 10.64898/2026.06.02.26354722 medRxiv
Top 1%
3.3%
Show abstract

Large language models (LLMs) such as ChatGPT are rapidly reshaping healthcare education and simulation-based training in non-technical skills (NTS), yet no bibliometric analysis has mapped this landscape. We searched seven open-access databases (OpenAlex, PubMed, Europe PMC, Crossref, Semantic Scholar, CORE, DOAJ) for English-language publications from January 2020 to March 2026. From 100,277 initial records, a sequential keyword funnel yielded 830 candidate papers, which were screened by 83 independent Claude Sonnet 4.6 AI agents applying pre-specified inclusion criteria (PRISMA-trAIce compliant; Cohen's kappa = 0.86 pre-reconciliation, 1.0 post-reconciliation). The final AI-verified corpus comprised 551 papers with a compound annual growth rate of 109%, contributions from 2,398 authors across 279 journals in 58 countries, and an h-index of 41. ChatGPT dominated the model landscape (46% of papers), with open-source models virtually absent. Virtual patient chatbots were the leading simulation modality (106 papers). Among NTS domains, communication (145 papers) and decision-making (135 papers) were most studied, whereas teamwork, leadership, situational awareness, and crisis resource management were markedly underrepresented. Only 6 urology-relevant papers were identified, none examining LLM integration within boot camp training formats. The field is growing at extraordinary pace but remains concentrated in a narrow range of NTS domains and a single proprietary model. Critical gaps persist in team-based skills training, open-source model evaluation, and specialty-specific simulation. AI-assisted bibliometric screening using multiple independent agents is feasible, reliable, and scalable, offering a replicable methodology for mapping fast-evolving research fields.

7
Trans-ancestry genome-wide association meta-analysis of antidepressant response to selective serotonin reuptake inhibitors in clinical studies of depression

Hu, K.; Lo, C. W. H.; Awasthi, S.; Pain, O.; Singh, M.; Ahn, Y.; Aitchison, K. J.; Baune, B. T.; Biernacka, J. M.; Bondolfi, G.; Carrillo-Roa, T.; Choi, H.; Czamara, D.; Domschke, K.; Fabbri, C.; Hamilton, S. P.; Ising, M.; Jang, Y.; Kato, M.; Kim, D. K.; Kim, D.; Lee, B.-C.; Lewis, G.; Lim, S.-W.; Liu, Y.-L.; Myung, W.; Perroud, N.; Serretti, A.; Tsai, S.-J.; Uher, R.; Weinshilboum, R.; Won, H.-H.; Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium, ; Ripke, S.; Coleman, J.; Lewis, C. M.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.03.26354703 medRxiv
Top 1%
2.7%
Show abstract

Antidepressants are widely prescribed for major depressive disorder, yet only one-third of patients achieve remission after initial treatment. Previous genome-wide association studies (GWAS) of clinically assessed antidepressant response combined multiple antidepressant classes, potentially obscuring class-specific effects. This study focused on selective serotonin reuptake inhibitors (SSRIs), often first-line due to better tolerability. Data from 15 cohorts across four ancestries were integrated: European (N = 3887; 11 studies), East Asian (N = 1068; 4), African (N = 277; 1), and Admixed American (N = 250; 1). GWAS of non-remission and percentage improvement were conducted within cohorts, followed by ancestry-specific meta-analyses and trans-ancestry meta-regression. Single nucleotide polymorphism (SNP)-based heritability was estimated in European samples. Polygenic scores were used for leave-one-out prediction and to assess shared genetic architecture with psychiatric traits. Gene-level and gene-set enrichment analyses were also performed. No genome-wide significant variants were identified for either outcome in any ancestry-specific or trans-ancestry analyses. However, trans-ancestry meta-regression yielded eight independent loci with suggestive associations (p < 1 x 10-5) for non-remission and 17 for percentage improvement. Gene-set analyses revealed nominal enrichment of the serotonergic synapse pathway for non-remission. SNP-based heritability estimates were not significantly different from zero for either outcome. Better SSRI response was nominally associated with lower genetic predisposition to major depressive disorder, post-traumatic stress disorder, and schizophrenia. This study represents the largest trans-ancestry GWAS of SSRI response, highlighting emerging biological signals. Limited power emphasises the need for larger and ancestrally diverse cohorts to better characterise the genetic architecture of antidepressant response.

8
Mortality in people with attention-deficit/hyperactivity disorder (ADHD): Examining how risk is embodied in a pooling of two prospective cohort studies

Li, H.; Ford, T.; Warrier, V.; Bell, S.; Batty, G. D.

2026-06-09 epidemiology 10.64898/2026.06.08.26355148 medRxiv
Top 2%
2.3%
Show abstract

Background. Nascent findings suggest that people with attention-deficit/hyperactivity disorder (ADHD) experience higher rates of mortality. To date, study samples have been insufficiently well-characterized to examine the mechanisms via which this neurodevelopmental condition elevates mortality risk. Methods. We used data from the 2007 and 2011 waves of the US National Health Interview Survey, a general population-based cohort study comprising 52097 adults (28675 women) aged 18 years or older at baseline. ADHD diagnosis and an array of demographic, socioeconomic, lifestyle, and co-morbidity (somatic and psychiatric) covariates were self-reported. Findings. At baseline, compared with unaffected individuals, participants with ADHD were more likely to be socioeconomically disadvantaged, smoke cigarettes, consume alcohol, and report symptoms of psychological distress. A median 7.75 years of mortality surveillance (range: 7.25-12.25) gave rise to 6597 deaths from all-causes. After adjustment for age, sex, ethnicity, and survey year, ADHD was associated with a markedly elevated risk of death (hazard ratio [95% confidence interval]: 1.58 [1.20-2.09]). Statistical adjustment for socioeconomic circumstances (11% attenuation), physical co-morbidities (15%), and lifestyle factors (17%) had only a modest impact on the ADHD-death gradient, with the greatest explanatory power apparent for symptoms of depression and anxiety (58%). The magnitude of the association of ADHD with mortality was commensurate to that for several well-established risk factors such as poverty (1.66 [1.55-1.78]), hypertension (1.41 [1.32-1.51]), and diabetes (1.71 [1.59-1.85]) but somewhat lower than cigarette smoking (2.51 [2.29-2.76]) after controlling for age, sex, ethnicity, and survey year. Associations between ADHD and cause-specific mortality from cardiovascular disease, cancer, and chronic respiratory disease were inconclusive. Interpretation. In the present study, the influence of ADHD on total mortality appears to be largely embodied via a series of malleable characteristics, particularly mental illness. If confirmed elsewhere, these results raise the possibility that risk factor modification via standard pharmacological and behavioral interventions could help reduce rates of premature mortality in this patient group. Funding. This paper received no direct funding. GDB is supported by the UK Medical Research Council (MR/P023444/1) and the US National Institute on Aging (1R56AG052519-01, 1R01AG052519-01A1).

9
Medical discrimination and the selective erosion of institutional health trust: evidence from the Health Information National Trends Survey 6 and 7

Park, A.; Yin, L.; Wong, A.; Lee, C.; Choi, Y.

2026-06-09 public and global health 10.64898/2026.06.06.26355057 medRxiv
Top 2%
1.7%
Show abstract

Medical discrimination may alter how patients relate to health information sources following adverse care encounters. We examined whether discrimination experience is associated with selective erosion of institutional health trust and with compensatory digital health engagement, using nationally representative data from the Health Information National Trends Survey (HINTS) 6 (2022; n=6,252) and HINTS 7 (2024; n=7,278). Survey-weighted modified Poisson regression estimated prevalence ratios (PRs) for binary high-trust outcomes, and survey-weighted ordinary least squares estimated coefficients for continuous outcomes; jackknife replicate weights (50 replicates) provided variance estimates. Discrimination was associated with substantially lower probability of high trust in the healthcare system (PR=0.39; 95% CI 0.30-0.52) and physicians (PR=0.85; 95% CI 0.77-0.94), with no significant association for trust in scientists, government, family, or religious organisations. The clinical-institutional pattern replicated in HINTS 6, which additionally showed reduced trust in scientists for race/ethnicity-based discrimination. Contrary to a disengagement hypothesis, discrimination-exposed adults showed higher probability of online health information seeking (PR=1.06), health app use (PR=1.11), and online provider messaging (PR=1.13); these associations persisted after adjustment for trust in physicians. Discrimination was independently associated with lower health self-efficacy (b=-0.271). Medical discrimination selectively erodes trust in clinical institutions while leaving broader epistemic trust largely intact. Despite this, discrimination-exposed patients engage more actively with digital health channels, consistent with compensatory reorientation toward non-clinical information sources. These findings describe engaged but institutionally alienated patients, with implications for restoring clinical trust and for equity-centred digital health design.

10
AIM-PrEP: AI-Agent Driven Multicenter Intervention to Improve PrEP Adherence and Health Monitoring Among Men Who Have Sex with Men (MSM)-Protocol of A Randomized Controlled Trial

Zeng, R.; Zuo, Z.; Yu, H.; Jin, Y.; Wang, Y.; Lv, H.; Wang, G.; Zhang, N.; He, H.; Huang, X.; Zhang, X.; Su, Q.; Xu, J.

2026-06-04 hiv aids 10.64898/2026.06.02.26354777 medRxiv
Top 2%
1.7%
Show abstract

Background: Pre-exposure prophylaxis (PrEP) has demonstrated a significant reduction in HIV infections among men who have sex with men (MSM), however, low medication adherence hinders its preventative effectiveness. Traditional approaches, such as health education and face-to-face inquiry (HEF), have demonstrated certain efficacy in improving PrEP adherence. However, these methods are resource-intensive and often plagued by delays, rendering timely and precise interventions challenging. This randomized controlled trial aims to assess the effectiveness of an intervention comprising AI-chatbot for PrEP (PrEP-bot) and Smart pillbox (SPB) (PrEP-bot-SPB) strategy to improve PrEP adherence among MSM compared to HEF.Methods and analysis: A three-arm, multicenter, open-lable RCT will be conducted with Chinese MSM [&ge;]18 years. A total of 300 participants will be recruited through three sources, including hospitals, community-based organizations (CBOs) and peer referral in five cities: Shenzhen, Beijing, Qingdao, Hangzhou and Zhengzhou. After completing baseline survey, participants will be randomized evenly into interventions or control groups: the PrEP-bot group, the PrEP-bot-SPB group, and the HEF control group. Participants in the PrEP-bot group will be granted access to an AI-chatbot agent through WeChat. This agent will: 1) generate personalized PrEP medication plans; 2) provide medication reminders and PrEP-related health check-ups notifications; 3) inquire about missed doses to deliver tailored interventions; 4) answer participant questions about PrEP using guideline-based knowledge. Participants in the PrEP-bot-SPB group will receive both the SPB and the PrEP-bot interventions. SPB could delivers medication reminders. Participants in HEF group will receive a health education pamphlet introducing PrEP and knowledge related to PrEP medication adherence at baseline and face-to-face inquiry every three months. Outcomes will be assessed for both short-term and medium-to-long-term effects. The primary objective is the effectiveness in improving PrEP adherence measured by self-report, Eight-Item Morisky medication adherence scale (MMAS-8) and concentration of Tenofovir in dried blood spots (DBS) (PrEP adherence [&ge;]90%) at 3 months follow-up. Secondary outcomes include: 1) effectiveness in preventing HIV infection measured by HIV-self test (HIVST); 2) effectiveness of PrEP-related health check-ups; 3) the effectiveness, feasibility, acceptability, and user satisfaction with the PrEP-bot; 4) effectiveness in improving PrEP adherence at 6-month, 9-month and 12-month follow-up periods. All participants will receive quarterly follow-up visits during the 12-month study period. Intention-to-treat analysis and per protocol set (PPS) analysis will be used.Results: Recruitment and enrollment of participants began in January 2026 and is currently ongoing.Discussion: This study is expected to establish a novel AI-based intervention model for PrEP, providing innovative strategies for HIV control among MSM populations. If the PrEP-bot is proven non-inferior to HEF, it could offer users real-time, precise, and personalized interventions while simultaneously addressing PrEP-related inquiries and health check-ups reminders. Importantly, this approach would achieve significant reductions in resource requirements for implementation and maintenance and be more cost-effective. With the ongoing advancement of AI technologies, PrEP-bot holds substantial promise for widespread implementation in PrEP adherence, potentially revolutionizing HIV prevention for MSM in China through this innovative intervention modality.Trial registration: ChiCTR2500111280 (Chinese Clinical Trial Registry). Date of registration: 29 October 2025.

11
A canary in the mind: A single baseline brain scan predicts adolescent depression and anxiety one year later

Deco, G.; Sanz Perl, Y.; Vohryzek, J.; Garcia-Guzman, E.; Pizzagalli, D. A.; Laukkonen, R.; Chandaria, S.; Kringelbach, M. L.

2026-06-10 psychiatry and clinical psychology 10.64898/2026.06.08.26355206 medRxiv
Top 2%
1.7%
Show abstract

Mood and anxiety disorders emerge predominantly in adolescence, yet they are usually identified only once symptoms have consolidated, when intervention can only be reactive. A marker that registers the loss of healthy brain function before symptoms crystallise would allow earlier and more targeted treatment, much as caged canaries once warned miners of danger before it became apparent. Here we report such a marker using a single baseline resting-state functional MRI scan in 150 adolescents in the Human Connectome Project Boston Adolescent Neuroimaging of Depression and Anxiety (HCP BANDA) cohort, allowing us to prospectively predict depression and anxiety symptoms one year later in held-out participants at r = 0.60, substantially above the effect-size ceiling reported for functional connectivity in the same data. The marker is not computed from raw functional connectivity but read out from a whole-brain generative model fitted to each individual's dynamics, which gives access to interference structure that covariance-based features cannot represent. The regions driving the prediction, including precuneus, ventromedial prefrontal and anterior cingulate cortices, are among those previously implicated in internalising disorders, and the same signature tracks cognitive variation in healthy participants and is mechanistically linked to the efficiency of task-related computation. These findings establish a mechanistically interpretable and prospectively predictive marker of adolescent mental health and define a clear path towards external validation and clinical use.

12
Prioritizing embryos with lower homozygosity may reduce disease risk in children of related individuals undergoing preimplantation genetic testing

Wolfram, T.; Ahangari, M.; Davidson, I.; Wartschinski, L.; Li, J. H.; Eyre, M.; Stern, D.; Schleede, J.; Haghighi, A.; Carmi, S.; Christensen, M.

2026-06-04 genetic and genomic medicine 10.64898/2026.05.30.26354526 medRxiv
Top 3%
1.5%
Show abstract

Consanguinity is a reproductive union between individuals who share a recent common ancestor. These unions are common in many regions of the world and increase the burden of rare recessive disorders by elevating autozygosity in offspring. Current reproductive genetic screening focuses on a limited set of known pathogenic variants, leaving most recessive risk unaddressed. Here we argue that embryo-level autozygosity, quantified as the fraction of the genome in long runs of homozygosity (FROH), is a potentially actionable genomic biomarker that can be integrated into routine preimplantation genetic testing as a homozygosity-informed embryo-prioritization framework (PGT-H) that can be layered onto existing embryo biopsy workflows when couples are already undergoing IVF with PGT-A or PGT-M. Using forward simulations of first-cousin and double-first-cousin couples, we show that siblings conceived by the same couple span a wide range of FROH; selecting the lowest-FROH candidate from a cohort of five embryos reduces FROH by approximately 40% on average. Combining these reductions with empirical effect-size estimates, we estimate that for first-cousin couples this strategy could reduce risk of intellectual disability by roughly 35-45% (corresponding to an absolute risk reduction of about 1.8-2.2%) and potentially reduce excess recessive disease burden, while also modestly reducing risk of common diseases such as type 2 diabetes. We outline how existing PGT-A and PGT-M workflows could potentially be extended to report embryo-level FROH and discuss ethical and counseling considerations. Autozygosity-based embryo prioritization offers a principled way to address a component of recessive risk that current variant-centric approaches miss.

13
Metatranscriptomics-Derived Disease Risk Scores as a Preventive, Diagnostic, and Treatment Support Tool

Hu, L.; Bass, M.; Patridge, E.; Molusky, M.; Antoine, G.; Vuyisich, M.; Banavar, G.

2026-06-06 genetic and genomic medicine 10.64898/2026.05.29.26354333 medRxiv
Top 3%
1.4%
Show abstract

Background: Chronic diseases and symptom syndromes often develop after prolonged biological changes that may precede formal diagnosis. RNA-based metatranscriptomics captures active microbial and human gene expression and may provide a functional layer for disease risk evaluation. To address this translational gap, we developed and validated a Disease Risk Score (DRS) framework that integrates metatranscriptome-derived pathway activity scores from stool, saliva, and blood samples, and evaluated its potential clinical utility as an adjunct risk-evaluation tool. Methods: DRS uses disease-specific sets of pathway activity scores derived from stool and saliva microbial functions, stool and saliva microbial taxa, and blood human gene expression. For each disease, 'not optimal' pathway scores are aggregated into a normalized cumulative odds ratio, or cOR, using score-level odds ratios, statistical significance, and literature-supported biological relevance derived from a Development Cohort of 22,369 individuals. A cOR [&ge;] 5 is defined as high risk. Performance is evaluated in an independent Validation Cohort of 15,908 individuals using self-reported diseases as the reference. Disease support requires both significant cOR separation between self-reported and not-reported (Cohen's d [&ge;] 0.2) and risk ratio enrichment of self-reported disease among individuals classified as high risk (95% CI of Risk Ratio > 1). Results: Of 20 initially evaluated diseases, 15 meet the prespecified validation criteria on the independent validation cohort: ADHD, anxiety, chronic fatigue syndrome, depression, GERD, hypertension, inflammatory bowel disease, IBS-C, IBS-D, insomnia, MASLD, obesity, obstructive sleep apnea, Sjogren's syndrome, and type 2 diabetes. Five selected clinical scenarios illustrate how DRS can support clinician-mediated decision making, including IBS subtype reclassification, improved diagnostic acceptance in IBS-D, personalized lifestyle counseling in MASLD and early type 2 diabetes, and diagnostic uncertainty in atypical GERD. Conclusions: DRS is a metatranscriptomics-based risk-stratification framework that aggregates active microbial and human pathway signals into interpretable disease-specific risk estimates across a wide range of disease conditions. Validation against self-reported disease labels in an independent cohort shows significant risk enrichment for each of 15 diseases. DRS is intended as an adjunct to clinical evaluation: a decision support tool in situations where routine care encounters uncertainty, delay, or low patient engagement. Future prospective studies using clinically adjudicated endpoints are needed to assess calibration and clinical outcomes.

14
Universal Periodic Review recommendations and trajectories of maternal health between 2005 and 2023: a longitudinal ecological analysis of 89 countries

Uppal, A.; Thomas, R.; De Pasquale, M.; Sillo, J.; Getahun, H.

2026-06-05 public and global health 10.64898/2026.06.03.26354800 medRxiv
Top 3%
1.3%
Show abstract

Background: The Universal Periodic Review (UPR) is a peer-review mechanism established to hold UN Member States accountable for human rights including the right to health, yet evidence on its impact on health outcomes is limited. We evaluated whether UPR engagement is associated with accelerated improvements in maternal health trajectories. Methods and Findings: We conducted a longitudinal ecological analysis of 89 countries with a baseline maternal mortality ratio (MMR) of 70 or greater per 100,000 live births in 2005. Outcomes were trajectories of annual MMR, skilled birth attendance (SBA), and contraceptive prevalence rate (CPR), from 2005 to 2023. The exposure was the volume of health-related UPR recommendations received across three cycles, thematically classified using a validated rule-based algorithm. Mixed-effects models adjusted for time-varying GDP per capita and historical fragility. The 89 countries received 41,733 UPR recommendations across three cycles, of which 405 (1%) were related to maternal health. Maternal health recommendations were preferentially directed at countries with higher baseline MMR and lower SBA. After adjustment, each additional maternal health recommendation was associated with a 0.24% [95% confidence interval (CI): 0.08, 0.40] faster annual reduction in MMR, a 0.52% [0.12, 0.91] faster annual gain in the odds of SBA, and a 0.21% [0.09, 0.34] faster annual gain in the odds of CPR. Broader recommendations on women's health and health systems and services were also associated with faster annual improvements in trajectories across all three outcomes; recommendations on abortion, family planning, sexual health and wellbeing, and sexual education tended to be directed towards lower-burden countries and were not associated with differences in any trajectories. It is important to note that the ecological design precludes causal inference. Conclusions: Receiving UPR recommendations on the themes of maternal health, womens health, and health systems and services are associated with accelerated improvements in maternal health trajectories among high-burden countries. These findings suggest that international human rights accountability mechanisms may have a role in supporting national progress on maternal health.

15
Placental molecular subtypes of severe preeclampsia reveal divergent aging trajectories and fetal growth outcomes

Du, Y.; Benny, P. A.; Lahiri, S.; AlAkwaa, F. M.; Huang, Q.; Liu, Y.; Lassiter, C. B.; Astern, J.; Riel, J.; Garmire, L. X.

2026-06-04 sexual and reproductive health 10.64898/2026.06.02.26354756 medRxiv
Top 3%
1.3%
Show abstract

Severe preeclampsia (sPE) is a major cause of maternal and fetal morbidity worldwide, yet its placental molecular heterogeneity remains poorly defined by current clinical diagnosis. To resolve the molecular architecture of sPE, here we integrated DNA methylation and proteomic profiling from a multi-ethnical cohort of 444 placentas from the Hawaiian Biorepository (HiBR), including 169 sPE cases, matched preterm controls and full-term controls. To address cellular heterogeneity in bulk placental tissue, we developed HOMED (Hierarchically Optimized Methylation Deconvolution), a single-cell-guided hierarchical framework for inferring placental cell-type composition from DNA methylation data. HOMED-adjusted integrative analyses identified extensive subtype-specific alterations involving hypoxia, angiogenesis, immune activation, trophoblast differentiation and metabolic remodeling. Molecular stratification revealed two reproducible sPE subtypes with divergent placental aging trajectories. One subtype exhibited a pre-mature placental state marked by accelerated placental aging, whereas the other displayed slower accelerated placental aging but a substantially increased risk of small-for-gestational-age birth (P = 0.028). These subtypes were independently replicated across six external cohorts and further supported by proteomic signatures achieving a classification accuracy of 0.88. Integrative epigenomic and proteomic analyses linked the growth-restricted subtype to hypoxia-associated glycolytic remodeling, suggesting distinct pathogenic mechanisms underlying clinically diagnosed sPE. Together, our findings redefine severe preeclampsia as a biologically heterogeneous placental disorder composed of molecularly distinct subtypes with divergent aging trajectories and fetal growth outcomes, providing a framework for mechanism-based stratification and precision obstetric medicine.

16
STELLAR: A flexible ensemble learning framework integrating rare variants to enhance polygenic risk prediction

Chen, T.; Li, X.; Mazumder, R.; Zhang, H.; Lin, X.

2026-06-09 genetic and genomic medicine 10.64898/2026.06.07.26355109 medRxiv
Top 3%
1.2%
Show abstract

Whole-exome and whole-genome sequencing technology has enabled the discovery of rare genetic variants associated with human health and diseases. However, existing statistical methods used for rare variant association testing are not well-suited for building genetic risk prediction models that jointly incorporate rare and common variants. We propose STELLAR, a flexible ensemble learning-based approach to compute rare variant polygenic risk scores (PRS) using association summary statistics to enhance conventional common variant PRS. Our method combines burden-based and penalty-based rare variant analysis and leverages functional annotation information to prioritize potentially causal variants within the prediction models. In simulation studies, PRS using STELLAR consistently showed the highest prediction accuracy compared to models using common variants alone or rare variant burdens. Applied to UK Biobank whole-exome sequencing data (n=310,831) across eight continuous and five binary traits, STELLAR significantly improved prediction accuracy, refined stratification of individuals at the highest genetic risk beyond common variants, and prioritized biologically relevant genes. STELLAR provides a scalable strategy to incorporate rare variants into PRS in addition to common variants, advancing precision risk prediction and enabling more comprehensive assessment of genetic contributions to complex diseases.

17
Parental educational attainment polygenic scores contribute to phenotypic heterogeneity in offspring with autism

Gao, S.; Sui, Y.; Tian, P.; Rao, X.; Yan, C.; Xu, Y.; Wang, T.

2026-06-08 genetic and genomic medicine 10.64898/2026.06.03.26354779 medRxiv
Top 3%
1.2%
Show abstract

Educational attainment-related polygenic scores have been implicated in autism spectrum disorder (ASD), but how parental polygenic scores shape offspring phenotypes remains unclear. Using genotyping and exome-sequencing data from 142,357 individuals (55,252 ASD cases) in a large ASD cohort, we dissected the direct and indirect genetic effects of educational attainment-related polygenic scores on ASD phenotypes. Trio-model analyses showed that parental polygenic scores for educational attainment (PGSEA ) were associated with milder core ASD symptoms, including social deficits and repetitive behaviors, predominantly through indirect genetic effects, whereas their associations with comorbidities were driven predominantly by direct genetic effects. PGSEA was also significantly negatively associated with rare variant burden and prenatal factors, although these factors contributed largely independently to most phenotypes. Adjustment for full-scale intelligence quotient (FSIQ) and socioeconomic status (SES) partially attenuated the indirect effects of PGSEA on offspring phenotypes. Finally, higher parental PGSEA was associated with later age at diagnosis in offspring, partly through its protective effects on ASD phenotypes. These findings indicate that indirect genetic effects of parentalPGSEA contribute substantially to phenotypic variation in ASD and highlight family-mediated pathways as an important component of ASD heterogeneity.

18
A risk-of-contagion index using a Bayesian based model for the COVID-19 epidemic in Mexico

Corona-Moreno, R.; Acuna-Zegarra, M. A.; Santana-Cibrian, M.; Velasco-Hernandez, J. X.

2026-06-10 health policy 10.64898/2026.06.09.26355274 medRxiv
Top 4%
0.8%
Show abstract

During the COVID-19 pandemic, limited testing capacity and reporting delays complicated epidemic surveillance and decision-making in Mexico. We calibrated \textit{covidestim}, a Bayesian nowcasting model, to estimate the total SARS-CoV-2 infections from reported cases and deaths using Mexican surveillance data. Disease-progression distribution priors were calibrated using Mexico City records and validated through comparisons with national seroprevalence surveys, hospitalization data, and annual reported severe-case rates across all states. Using the reconstructed estimates of active infections, we implemented an event-based risk framework that quantifies the probability of encountering at least one infectious individual in gatherings of different sizes. This probability was subsequently translated into a four-level epidemiological traffic-light indicator and computed at both state and municipality levels. The resulting estimates revealed substantial spatial heterogeneity that is obscured by state-level aggregation, particularly in states with marked differences between urban and rural municipalities. To evaluate consistency with public-health indicators, we compared the proposed risk classification with the official Mexican epidemiological traffic-light system, considering interpretable gathering sizes relevant to public-health decision making. Weekly reports derived from this framework were delivered to policymakers in the State of Queretaro in Mexico, as an anticipation tool for school reopening and public-space management. This demonstrates that this Bayesian reconstruction of infections combined with event-based risk metrics can provide an interpretable and generalizable municipality-level complement to routine surveillance systems, particularly in regions with limited testing capacity and heterogeneous local transmission dynamics.

19
Heart Rate Circadian Oscillations as Digital Biomarkers of Cardiometabolic Health Determinants

Colitta, A.; Bruno, S.; Benedetti, D.; Hoxhaj, D.; Cruz-Sanabria, F.; Di Pede, C.; Buracchi Torresi, F.; Frumento, P.; Gargani, L.; Fabbrini, M.; Maestri Tassoni, M.; Bonanni, E.; Faraguna, U.

2026-06-10 cardiovascular medicine 10.64898/2026.06.07.26355124 medRxiv
Top 4%
0.8%
Show abstract

AIMS Cardiometabolic risk factors may impair health by altering the autonomic modulation of the cardiovascular system, a physiological process described by heart rate (HR) circadian oscillations. However, the impact of cardiometabolic health determinants on HR circadian oscillations remains scarcely characterized in real-world, population-based settings. To address this, we applied digital health technologies to investigate how cardiometabolic health determinants shape HR circadian oscillations in a real-world cohort of individuals free of cardiometabolic diseases. METHODS First, a 10-fold cross-validation of a model was performed, aiming at mitigating wearables measurement error caused by motion artifacts. This process was informed by 10,056 epochs of concurrent wearable-derived and polysomnographic HR assessment, yielding an average 1.3 bpm reduction in wearables measurement error. We subsequently applied this model to over 2 million 1-minute epochs of HR data, derived from 7-day continuous actigraphic recordings of 245 individuals free of cardiometabolic disorders. Functional-on-scalar regression modelling and both parametric and nonparametric analyses characterized HR circadian profiles and their relationships with demographics, lifestyle, chronotype, sleep health, and chronic insomnia diagnosis. A 6-dimension sleep health index was calculated. RESULTS Sex, chronotype, and sleep health predominantly shaped HR circadian oscillations. In detail, females consistently showed higher HR across the 24 hours. Moreover, chronotype was associated to a phase shift in HR circadian profiles, with later timings corresponding to eveningness. Notably, sleep health impacted HR circadian oscillations in a dose-dependent fashion: each additional impaired sleep dimension was associated with a 1.2 bpm HR increase during nighttime, alongside reduced circadian robustness and delayed oscillation timings. Finally, the earlier occurrence of morning HR peaks served as a digital biomarker of insomnia (80% specificity, 74% sensitivity). CONCLUSIONS This work provides a digital health framework to characterize HR circadian oscillations in free-living populations and supports its clinical utility in capturing the autonomic disruptions related to cardiometabolic health determinants.

20
EXHEART: A Fairness-Aware Explainable Stacked Ensemble for Cardiovascular Disease Classification with Cross-Instrument Disparity Attribution

Biswas, M. A.; Laila, A.

2026-06-05 health informatics 10.64898/2026.06.03.26354879 medRxiv
Top 5%
0.7%
Show abstract

Background: Machine learning models trained on population health surveys offer scalable tools for cardiovascular screening, but recurring methodological weaknesses undermine their credibility and equity: data leakage from synthetic oversampling, qualitative rather than quantitative explainability evaluation, and the absence of demographic fairness auditing at the clinical operating threshold. Methods: We present EXHEART, a leakage-free stacked ensemble pipeline trained on BRFSS 2015 (n = 253,680) and validated on BRFSS 2020 (n = 319,795; temporal transport and retrain) and a clinical cardiovascular examination dataset (n = 68,730). The pipeline combines XGBoost, LightGBM, Random Forest, and a multi-layer perceptron as base learners with 5-fold out-of-fold logistic regression stacking and Platt scaling calibration. A quantitative SHAP-LIME consistency framework, based on Kendall-tau rank correlation and Jaccard overlap, accompanies a decision-curve analysis, a subgroup-stratified SHAP interaction analysis, and an intersectional fairness audit (Sex x Age x Income) with threshold-shifting mitigation and a frontier of the fairness-utility trade-off. The framework also adds cross-instrument fairness-disparity attribution, an empirical diagnostic that provides evidence on whether an observed subgroup disparity is more consistent with a measurement-induced or a substantive explanation by re-validating it on a dataset that measures the same clinical construct objectively. On heart disease, this diagnostic associates 89% of the sex TPR gap (95% CI [0.65, 0.99]) with the self-reported survey outcome rather than with a substantive risk difference. Results: On BRFSS 2015, EXHEART achieves AUC-ROC = 0.850, AUPRC = 0.371, Brier score = 0.071, and reduces ECE by 96% (0.256 to 0.011) via Platt scaling. Global SHAP-LIME rank agreement is moderate-to-strong (Kendall-tau = 0.580, Spearman-rho = 0.818) with a substantial top-3 divergence (Jaccard@3 = 0.200), where Stroke flips from SHAP rank 8 to LIME rank 1. The Sex TPR gap is 0.124 at the screening threshold; intersectional Sex x Age disparities reach 0.649 among adequately-powered cells, 5.2x the single-attribute gap. Temporal transport to BRFSS 2020 collapses sensitivity from 0.776 to 0.267, while retraining restores AUC = 0.840 and ECE = 0.012. On clinical examination data, the Sex TPR gap collapses to 0.014; the attribution test indicates this gap is instrument-dependent, consistent with a measurement or outcome-definition explanation rather than a substantive risk difference. Cross-domain SHAP analysis identifies four instrument-independent CVD risk factors and two major portability failures. Conclusions: EXHEART combines three practices that population-scale cardiovascular classifiers usually apply in isolation: leakage-free training with calibrated probabilities, a test of whether the model's explanations are stable, and a fairness audit that examines intersecting subgroups rather than single attributes. Bringing them together proved worthwhile. The intersectional audit revealed disparities that single-attribute auditing missed, and the cross-instrument comparison indicated that much of the sex gap reflects how the outcome is measured in survey data rather than a substantive difference in risk. The temporal transport findings indicate that deployed BRFSS models warrant periodic monitoring and retraining to maintain clinical utility. EXHEART is a retrospective methodological evaluation on public de-identified data; it is not validated for direct clinical decision-making, diagnosis, or treatment recommendation without prospective clinical validation.